On the von Neumann capacity of noisy quantum channels 
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We discuss the capacity of quantum channels for information transmission and storage. Quantum 
channels have dual uses: they can be used to transmit known quantum states which code for classical 
information, and they can be used in a purely quantum manner, for transmitting or storing quantum 
entanglement. We propose here a definition of the von Neumann capacity of quantum channels, 
which is a quantum mechanical extension of the Shannon capacity and reverts to it in the classical 
limit. As such, the von Neumann capacity assumes the role of a classical or quantum capacity 
depending on the usage of the channel. In analogy to the classical construction, this capacity 
is defined as the maximum von Neumann mutual entropy processed by the channel, a measure 
which reduces to the capacity for classical information transmission through quantum channels 
(the "Kholevo capacity") when known quantum states are sent. The quantum mutual entropy 
fulfills all basic requirements for a measure of information, and observes quantum data-processing 
inequalities. We also derive a quantum Fano inequality relating the quantum loss of the channel to 
the fidelity of the quantum code. The quantities introduced are calculated explicitly for the quantum 
"depolarizing" channel. The von Neumann capacity is interpreted within the context of superdense 
coding, and an "extended" Hamming bound is derived that is consistent with that capacity. 



PACS numbers: 03.65.Bz,89.70.+c 



I. INTRODUCTION 

The problem of transmission and storage of quantum 
states has received a considerable amount of attention 
recently, owing to the flurry of activity in the field of 
quantum computation [[l] sparked by Shor's discovery 
of a quantum algorithm for factoring In anticipa- 
tion of physical realizations of such computers (which 
still face major conceptual challenges), it is necessary to 
extend to the quantum regime the main results of Shan- 
non's information theory ||, which provides limits on 
how well information can be compressed, transmitted, 
and preserved. In this spirit, the quantum analogue of 
the noiseless coding theorem was obtained recently by 
Schumacher [Q. However, noisy quantum channels are 
less well understood, mainly because quantum noise is of 
a very different nature than classical noise, and the notion 
of "quantum information" is still under discussion. Yet, 
important results have been obtained concerning the cor- 
rection of errors induced by the decoherence of quantum 
bits via suitable quantum codes. These error-correcting 
codes [j5|-[l2f work on the principle that quantum informa- 
tion can be encoded in blocks of qubits (codewords) such 
that the decoherence of any qubit can be corrected by an 
appropriate code, much like the classical error-correcting 
codes. Therefore, it is expected that a generalization of 
Shannon's fundamental theorem to the quantum regime 
should exist, and efforts towards such a proof have ap- 
peared recently p3[-^5[. The capacity for the transmis- 
sion of classical information through quantum channels 
was recently obtained by Hausladen et al. ||16[| for the 
transmission of pure states, and by Kholevo |17|| for the 
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general case of mixed states. 

When discussing quantum channels, it is important to 
keep in mind that they can be used in two very differ- 
ent modes. On the one hand, one may be interested 
in the capacity of a channel to transmit or else store, 
an unknown quantum state in the presence of quantum 
noise. This mode is unlike any use of a channel we are 
accustomed to in classical theory, as strictly speaking 
classical information is not transmitted in such a use 
(no measurement is involved). Rather, such a capac- 
ity appears to be a measure of how much entanglement 
can be transmitted (or maintained) in the presence of 
noise induced by the interaction of the quantum state 
with a "depolarizing" environment. On the other hand, 
a quantum channel can be used for the transmission of 
known quantum states (classical information), and the 
resulting capacity (i.e., the classical information trans- 
mission capacity of the quantum channel) represents the 
usual bound on the rate of arbitrarily accurate informa- 
tion transmission. In this paper, we propose a definition 
for the von Neumann capacity of a quantum channel, 
which encompasses the capacity for procesing quantum 
as well as classical information. This definition is based 
on a quantum mechanical extension of the usual Shan- 
non mutual entropy to a von Neumann mutual entropy, 
which measures quantum as well as classical correlations. 
Still, a natural separation of the von Neumann capacity 
into classical and purely quantum pieces does not ap- 
pear to be straightforward. This reflects the difficulty 
in separating classical correlation from quantum entan- 
glement (the "quantum separability" problem, see, e.g., 
]18[ and references therein). It may be that there is no 
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unambiguous way to separate classical from purely quan- 
tum capacity for all channels and all noise models. The 
von Neumann capacity we propose, as it does not involve 
such a separation, conforms to a number of "axioms" 
for such a measure among which are positivity, subad- 
ditivity, concavity (convexity) in the input (output), as 
well as the data processing inequalities. We also show 
that the von Neumann capacity naturally reverts to the 
capacity for classical information transmission through 
noisy quantum channels of Kholevo |T^] (the Kholevo ca- 
pacity) if the unknown states are measured just before 
transmission, or, equivalently, if the quantum states are 
prepared. In such a use, thus, the "purely quantum piece" 
of the von Neumann capacity vanishes. We stop short of 
proving that the von Neumann capacity can be achieved 
by quantum coding, i.e., we do not prove the quantum 
equivalent of Shannon's noisy coding theorem for the to- 
tal capacity. We do, however, provide an example where 
the von Neumann capacity appears achievable: the case 
of noisy superdense coding. 

In the next section we recapitulate the treatment of 
the classical communication channel in a somewhat novel 
manner, by insisting on the deterministic nature of classi- 
cal physics with respect to the treatment of information. 
This treatment paves the way for the formal discussion 
of quantum channels along the lines of Schumacher |l3| ] 
in Section III, which results in a proposal for the def- 
inition of a von Neumann capacity for transmission of 
entanglement /correlation that parallels the classical con- 
struction. We also prove a number of properties of such 
a measure, such as subadditivity, concavity/convexity, 
forward/backward quantum data-processing inequalities, 
and derive a quantum Fano inequality relating the loss 
of entanglement in the channel to the fidelity of the code 
used to protect the quantum state. This proof uses an 
inequality of the Fano-type obtained recently by Schu- 
macher |l3[ ]. In Section IV we demonstrate that the 
von Neumann capacity reduces to the recently obtained 
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if the quantum states are known, 
kept in memory" , before sending 
them on. In Section V then we apply these results di- 
rectly to a specific example, the quantum depolarizing 
channel []l9| . This generic example allows a direct calcu- 
lation of all quantities involved. Specifically, we calculate 
the entanglement/correlation processed by the channel 
as a function of the entropy of the input and the prob- 
ability of error of the channel. We also show that this 
capacity reverts to the well-known capacity for classi- 
cal information transmission in a depolarizing channel if 
known quantum states are transmitted through the chan- 
nel. In Section VI finally, we interpret the von Neumann 
capacity in the context of superdense coding and derive 
a quantum Hamming bound consistent with it. 



II. CLASSICAL CHANNELS 

The information theory of classical channels is well 
known since Shannon's seminal work on the matter ||. 
In this section, rather than deriving any new results, we 
expose the information theory of classical channels in the 
light of the physics of information, in preparation of the 
quantum treatment of channels that follows. Physicists 
are used to classical laws of physics that are determinis- 
tic, and therefore do not consider noise to be an intrinsic 
property of channels. In other words, randomness, or 
a stochastic component, does not exist per se, but is a 
result of incomplete measurement. Thus, for a physicist 
there are no noisy channels, only incompletely monitored 
ones. As an example, consider an information transmis- 
sion channel where the sender's information is the face 
of a coin before it is flipped, and the receiver's symbol is 
the face of the coin after it is flipped. Information theory 
would classify this as a useless channel, but for a physicist 
it is just a question of knowing the initial conditions of the 
channel and the environment well enough. From this, he 
can calculate the trajectory of the coin, and by examin- 
ing the face at the received side infer the information sent 
by the sender. Classical physics, therefore, demands that 
all conditional probability distributions can be made to 
be peaked, if the environment, enlarged enough to cover 
all interacting systems, is monitored. In other words, 
Pi\j — 1 or for all i, j: if the outcome j is known, 
i can be inferred with certainty. As a consequence, all 
conditional entropies can be made to vanish for a closed 
system. 

According to this principle, let us then construct the 
classical channel. Along with the ensemble of source sym- 
bols X (symbols x\, - ■ ■ , xn appearing with probabilities 
pi, ■ ■ ■ ,pn), imagine an ensemble of received symbols Y . 
The usual noisy channel is represented by the diagram 
on the left in Fig. |l|: the conditional entropy H(X\Y) 
represents the loss L in the channel, i.e., the uncertainty 
of inferrring X from Y, whereas H (Y\ X) stands for noise 
N in the output, which is unrelated to the error-rate of 
the channel. 

FIG. 1. (a) Entropy Venn diagram for the classical channel 
XY, and its "physical" extension including the environment. 




A channel for which L = is called a "lossless" channel 
(no transmission errors occur), whereas N — charac- 
terizes a "deterministic" channel (the input unambigu- 
ously determines the output). On the right-hand side 
in Fig. fy, we have extended the channel to include the 
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environment. All conditional entropies are zero, and 
the noise and loss are simply due to correlations of the 
source or received ensembles with an environment, i.e., 
L = H{X:E\Y) and N = H{Y:E\X). The capacity of 
the classical channel is obtained by maximizing the mu- 
tual entropy between source and received symbols [the 
information / = H(X:Y) processed by the channel] over 
all input distributions: 



C 



max I 

p(x) 



(2.1) 



If the output of the channel Y is subjected to another 
channel (resulting in the output Z, say), it can be shown 
that the information processed by the combined channel, 
H(X:Z), cannot possibly be larger than the information 
processed in the first leg, H(X;Y). In other words, any 
subsequent processing of the output cannot possibly in- 
crease the transmitted information. This is expressed in 
the data-processing inequality (see, e.g., |po|): 



H{X:Z) < H{X:Y) < H(X) 



(2.2) 



On the same token, a "reverse" data-processing inequal- 
ity can be proven, which implies that the information 
processed in the second leg of the channel, H(Y:Z), must 
exceed the information processed by the total channel, 
H{X:Z); 



H{X:Z) < H(Y:Z) < H(Z) 



(2.3) 



This inequality reflects microscopic time-reversal invari- 
ance: any channel used in a forward manner can be used 
in a backward manner. 

As far as coding is concerned, the troublesome quantity 
is the loss L, while the noise N is unimportant. Indeed, 
for a message of length n, the typical number of input 
sequences for every output sequence is 2 nL , making de- 
coding impossible. The principle of error-correction is to 
embed the messages into codewords, that are chosen in 
such a way that the conditional entropy of the ensemble 
of codewords vanishes, i.e., on the level of message trans- 
mission the channel is lossless. Not surprisingly, there is 
then a relationship between the channel loss L and the 
probability of error p c of a code c that is composed of s 
codewords: 

L<H 2 \p c ]+p c log{s-l) , (2.4) 
where i?2[p] is the dyadic Shannon entropy 
H 2 [p] =H 2 [l-p] = -plogp - (1 -p)log(l -p) . (2.5) 



Eq. (2.4) is the Fano inequality (see, e.g., [|20| ), which 
implies, for example, that the loss vanishes if the error 
of the code vanishes. Note that the noise of the channel 
itself in general is not zero in this situation. Let us now 
turn to quantum channels. 



III. QUANTUM CHANNELS 



A. Information theory of entanglement 



Quantum channels have properties fundamentally dif- 
ferent from the classical channel just described owing to 
the superposition principle of quantum mechanics and 
the non-cloning theorem that ensues |^l[ . First and fore- 
most, the "input" quantum state, after interaction with 
an environment, is "lost", having become the output 
state. Any attempt at copying the quantum state be- 
fore decoherence will result in a classical channel, as we 
will see later. Thus, a joint probability for input and out- 
put symbols does not exist for quantum channels. How- 
ever, this is not essential as the quantity of interest in 
quantum communication is not the state of an isolated 
quantum system (a "product state"), but the degree of 
entanglement between one quantum system and another, 
parameterized by their mutual entropy as shown below. 
A single non-entangled quantum system (such as an iso- 
lated spin-1/2 state) carries no entropy and is of no in- 
terest for quantum communication as it can be arbitrar- 
ily recreated at any time. Entangled composite systems 
(such as Bell states) on the other hand are interesting be- 
cause the entanglement can be used for communication. 
Let us very briefly recapitulate the quantum information 
theory of entanglement |22|-p5f . 

For a composite quantum system AB, we can write 
relations between von Neumann entropies that precisely 
parallel those written by Shannon for classical entropies. 
Specifically, we can define the conditional entropy of A 
(conditional on the knowledge of B) 



S(A\B) = S{AB) - S(B) 



(3.1) 



via a suitable definition of a "conditional" density matrix 
Pa\b- The latter matrix can have eigenvalues larger than 
unity, revealing its non-classical nature and allowing con- 
ditional quantum entropies to be negative |22| | . Similarly, 
we can define a "mutual" density matrix pA-.B giving rise 
to a mutual von Neumann entropy 



S(A:B) = S(A) + S{B) - S(AB) 



(3.2) 



which exceeds the usual bound obtained for mutual Shan- 
non entropies by a factor of two: 



S(A:B) < 2mm[S{A),S(B)} 



(3.3) 



The latter equation demonstrates that quantum systems 
can be more strongly correlated than classical ones: they 
can be supercorrelated. These relations can be conve- 
niently summarized by entropy Venn diagrams (Fig. ||a) 
as is usual in classical information theory. The extension 
to the quantum regime implies that negative numbers 
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can appear which are classically forbidden 1 . As an ex- 
ample, we show in Fig. |^b the quantum entropies of Bell 
states (which are fully entangled states of two qubits). 
These notions can be extended to multipartite systems, 
and will be used throughout the paper. 

FIG. 2. (a) Entropy Venn diagram for a bipartite entangled 
quantum system AB, depicting S(AB) (total area), marginal 
entropies [S(A) viz. S(B)\, conditional [S(A\B) viz. S(B\A)] 
and mutual [S(AB)] entropies, (b) Entropy diagram for a 
fully entangled Bell-state. 



(a) 



S(A) 



S(B) 



(b) 



S(A) 



S(B) 




The degree of entanglement of a bipartite pure quan- 
tum state is customarily indicated by the marginal en- 
tropy of one of its parts, i.e., the von Neumann entropy 
of the density matrix obtained by tracing the joint den- 
sity matrix over the degrees of freedom of the other part 
(the entropy of entanglement, see jL(|). However, since 
the parts of an entangled system do not possess a state 
on their own, it takes up to twice the marginal entropy 
of one of the parts to specify (in bits) the state of en- 
tanglement. For example, it takes up to two bits to 
specify the entanglement between two qubits (there are 
four Bell-basis states) . Thus, we propose to measure the 
entanglement of pure states by the mutual entropy be- 
tween the two parts, which takes values between (for 
non-entangled systems) and 2S (for entangled systems of 
marginal entropy S each). In order to avoid confusion 
with the previously defined entropy of entanglement, we 
propose to call this quantity the mutual entanglement (or 
simply von Neumann mutual entropy), and denote it by 
the symbol Iq: 



Iq = S(A:B) 



(3.4) 



For pure entangled states, the mutual entanglement Iq 
is just twice the entropy of entanglement, demonstrating 
that either is a good measure for the degree of entangle- 
ment, but not necessarily for the absolute amount. Es- 
timating the entanglement of mixed states, on the other 
hand, is more complicated, and no satisfying definition 
is available (see []10| for the most established ones). The 
quantum mutual entropy for mixed states does not rep- 
resent pure quantum entanglement, but rather classical 
and quantum entanglement that is difficult to separate 
consistently. For reasons that become more clear in the 



following, we believe that the mutual entanglement Iq 
between two systems is the most straightforward gener- 
alization of the mutual information / of classical infor- 
mation theory, and will serve as the vehicle to define a 
quantum/classical von Neumann capacity for quantum 
channels. 



B. Explicit model 



In constructing a general quantum channel formally, 
we follow Schumacher Jl3[ ] . A quantum mixed state Q 
suffers entanglement with an environment E so as to lead 
to a new mixed state Q 1 with possibly increased or de- 
creased entropy. In order to monitor the entanglement 
transmission, the initial mixed state Q is "purified" by 
considering its entanglement with a "reference" system 
R: 



\RQ)=J2 



'Pi \n,i) 



(3.5) 



where |rj) are the R eigenstates. Indeed, this can al- 
ways be achieved via a Schmidt decomposition. Then, 
the mixed state Q is simply obtained as a partial trace 
of the pure state QR: 



Pq = TirIpqr] =^2Pi\i)(i\ 



(3.6) 



Also, the interaction with the environment 

QRE Uq ^ 1r Q'R'E 1 (3.7) 



now can be viewed as a channel to transmit the entangle- 
ment between QR to the system Q'R'. Here, Uqe is the 
unitary operation entangling QR with the environment 
E, which is initially in a pure state. This construction is 
summarized in Fig. |^. 

FIG. 3. Quantum network representation of a noisy quan- 
tum channel. R purifies the mixed state Q; the corresponding 
entanglement is indicated by a dashed line. 



R 

Q 



I0>- 




R 

Q' 

E' 



In classical entropy Venn diagrams, negative numbers can 
only appear in the mutual entropy of three or more systems. 
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The evolution of entropies in such a channel is depicted 
in Fig. |J, where the entropy of the reference state [which 
is the same as the entropy of Q before entanglement, 
S(Q) = S{R)} is denoted by S, 

S=-^ Pi logft, (3.8) 

i 

while the entropy of the quantum state Q' after entan- 
glement S(Q') — S' , and the entropy of the environment 
S(E) = S e . The latter was termed "exchange entropy" 
by Schumacher |jl3| . 

FIG. 4. Unitary transformation entangling the pure envi- 
ronment \E) with the pure system \QR). The reference sys- 
tem R is not touched by this transformation, which implies 
that no entropy can be exchanged across the double solid lines 
in the diagram on the left. 




Note that, as for any tripartite pure state, the entropy 
diagram of the entangled state Q'R'E' is uniquely fixed 
by three parameters, the marginal entropies of Q', R', 
and E' respectively, i.e., the numbers S, S", and S e . Also, 
in any pure entangled diagram involving three systems, 
the ternary mutual entropy [the center of the ternary di- 
agram, S(Q':R':E% is always zero [[|ff|. 

To make contact with the classical channel of the pre- 
vious section, let us define the quantum loss Lq 2 : 

L q = S(R':E'\Q') = S e + S - S' . (3.9) 

It represents the difference between the entropy acquired 
by the environment, S e , and the entropy change of Q, 
(S" — S), and thus stands for the loss of entanglement in 
the quantum transmission. It plays a central role in er- 
ror correction as shown below and in Section III.D. The 
entropy diagram in terms of S, S e , and Lq is depicted 
in Fig. |5| From this diagram we can immediately read 
off inequalities relating the loss Lq and the entropies S 
and S e by considering triangle inequalities for quantum 
entropies pq], namely 



< L Q < 2S , (3.10) 

< L Q < 2S e , (3.11) 

which can be combined to 

< L Q < 2min(S',S , e ) . (3.12) 



We find therefore that the initial mutual entanglement 
2S is split, through the action of the environment, into 
a piece shared with Q' [i.e., S(Q':R') = 2S — Lq], and 
a piece shared with the environment (the remaining loss 
Lq) according to the relation 

S(R':Q') + S(R':E'\Q') = S(R':E'Q') = S(R:Q) , (3.13) 
or equivalently 

Iq + L q = 2S . (3.14) 

FIG. 5. Entropy diagram summarizing the entropy rela- 
tions between the entangled systems Q' , R' , and E' . 




Finally, we are ready to propose a definition for the von 
Neumann capacity. Again, in analogy with the classical 
construction, the von Neumann capacity Cq would be 
the mutual entanglement processed by the channel (mu- 
tual von Neumann entropy) , maximized over the density 
matrix of the input channel, i.e., 

Cq = max/n , (3.15) 

PQ 

where Iq — S(R':Q') = S(R:Q') is the entanglement 
processed by the channel: 

I Q = 2S-L Q . (3.16) 

From the bound ( Jj.lOp we find that the entanglement 
processed by the channel is non-negative, and bounded 



2 We follow here the nomenclature that "quantum" always 
means "quantum including classical", rather than "purely 
quantum", in the same sense as the von Neumann entropy 
is not just a purely quantum entropy. This nomenclature is 
motivated by the difficulty to separate classical from quantum 
entanglement. 
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from above by the initial entanglement 2S. An interest- 
ing situation arises when the entanglement processed by 
the channel saturates this upper bound. This is the case 
of the lossless quantum channel, where Lq = 0. 

It was shown recently by Schumacher and Nielsen |l4| ] 
that an error-correction procedure meant to restore the 
initial quantum state (and thus the initial entanglement 
2S) can only be successful when Lq — 0. From Fig. || 
we can see that when Lq = 0, Q' is entangled separately 
with the reference state and the environment, leading to 
the diagram represented in Fig. For this reason alone 
it is possible to recover the initial entanglement between 
Q and R via interaction with an ancilla A (that can be 
viewed as a second environment in a "chained" channel). 
The latter effects a transfer of the entanglement between 
Q' and E' to entanglement between E' and A. This oper- 
ation can be viewed as an "incomplete" measurement of 
Q' by A which only measures the environment E' while 
keeping intact the entanglement of Q' with R. It was 
shown in (l4|] that Lq = is in fact a necessary and suf- 
ficient condition for this to be feasible. Such a transfer 
of entanglement corresponds to the quantum equivalent 
of error correction, and will be discussed with reference 
to the quantum Fano inequality in Section III.D. 

FIG. 6. Entanglement between Q' , R' , and E' in the loss- 
less quantum channel 

Q' R' 



( S e 


-S 1 










f 2S e 


V J 


\ 




















/ £" 



C. Axioms for quantum information 

In the following, we present a number of reasonable 
"axioms" for a quantum mutual information, and show 
that Iq defined above has the required properties. These 
are: 

(i) non-negativity 

(ii) concavity in pq (for a fixed channel) 

(iii) convexity in p'g (for fixed pq) 

(iv) subadditivity 

These requirements for a quantum mutual entropy 
( "entanglement processed by the channel" ) are very nat- 
ural and reflect the kind of requirements that are put on 
classical channels. The non-negativity of Iq is simply a 



consequence of the subadditivity of quantum entropies. 
(Just like the mutual Shannon entropy, the mutual quan- 
tum entropy is a non-negative quantity). Concavity of 
quantum information in pq [axiom (ii)] reflects that the 
information processed by a channel with a mixture of 
quantum states pq = J2i Wip l Q (with J2i w i — 1) as input 
should be larger than the average information processed 
by channels that each have a mixture p l Q as input, i.e., 

Iq(pq)>$>*Wq)- (3-17) 

i 

This is the quantum analogue of the concavity of the 
Shannon mutual information H(X:Y) in the input prob- 
ability distribution p(x) for a fixed channel, i.e., fixed 
p(y\x). The proof uses that, if the quantum operation 
achieved by the channel is fixed, we have 

Pqe = Uqe (^Wtp^^iO^UQz 

i 
i 

Therefore, using 

I Q (p Q ) = S(R:Q') 

= S(R) + S(Q') - S(RQ') 

= S(Q'E') + S(Q')-S(E') 

= SiQ'lE^ + SiQ 1 ) (3.19) 

the concavity of the quantum information in the input 
results from the concavity of S(Q'\E') in p'q E and from 
the concavity of S(Q') in p'q J27J. 

Convexity of the processed information in p'q [axiom 
(iii)] states that, if the superoperator that takes a fixed 
Pq into p'q is such that 

3 

then 

Hpq p'q) < J2 w ^q(pq -> p'q) ■ ( 3 - 21 ) 

3 

Thus, the processed information of a channel that is a 
"superposition" of channels (each used with probability 
Wj) that result in p'q cannot exceed the average of the 
information for each channel. One has a similar property 
for classical channels: the mutual information H(X:Y) is 
a convex function of p(y\x) for a fixed input distribution 
p(x). The proof follows from noting that, if the input is 
fixed, we have 



G 



Prq 



E 



W OPRQ 



(3.22) 



Then, expressing the quantum information as 
IQ(PQ Pq) = S(R:Q') = S(R) - S(R\Q') . (3.23) 

and noting that S(R) is constant, the concavity of 
S(R\Q') in p' R Q implies the convexity of the quantum 
information in the output. 

Finally, the subadditivity of quantum information [ax- 
iom (iv)] is a condition which ensures that the informa- 
tion processed by a joint channel with input Pq 1 q 2 is 
smaller or equal to the information processed "in par- 
allel" by two channels with input pq 1 — T^q 2 (pq 1 q 2 ) 
and pq 2 — ^q 1 (pq 1 q 2 ) respectively. Thus, if R is the 
reference system purifying the joint input Q1Q2, Qi is 
purified by RQ 2 while Q 2 is purified by RQ\ (see Fig. [?]). 

FIG. 7. Parallel channels as quantum network, in the 
derivation of the subadditivity of mutual von Neumann en- 
tropies. The entanglement between Q\, Q2, and the reference 
is indicated by a dashed line. 

R 
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The subadditivity of von Neumann mutual entropies 
for such a channel can be written as 

SiR-.Q'iQ'z) < S(RQ 2 :Q[) + S(RQi:Q' 2 ) , (3.24) 

which can be read as 

I12 <h+h (3.25) 

with the corresponding identifications, and mirrors the 
classical inequality 



H{XxX2-YiY 2 ) < H{Xi-Y x ) + H{X 2 :Y 2 ) 

for two independent channels taking X\ - 
X 2 ^Y 2 . 



(3.26) 
Y\ and 



To prove inequality (3.24), we rewri te th e quantum in- 
formation of each channel using Eq. ( 3.19 ) and th e fac t 
that E\ and E 2 are initially in a product state. Eq. (3.24) 
then becomes 



S{Q'ME[E' 2 ) + S{Q'M < 

S(Qil^) + S(Q[) + S(Q' 2 \E' 2 ) + S(Q' 2 ) . 

Subadditivity of conditional entropies, i.e., 

SiQiQ'zlEiE's) 

= SiQ'^EiE!,) + S(Q' 2 \E[E 2 ) - S (Q'^E'^) 

" v ' 



(3.27) 



< SiQ^E'^) + S(Q' 2 \E[E' 2 ) 

< SiQ'^Ei) - SiQ'^E',) +S(Q' 2 \E' 2 ) - S(Q' 2 :E[\E' 2 ) 



< SiQ'^Ei) + S(Q' 2 \E' 2 ) , 



(3.28) 



together with the subadditivity property of or dinar y 
(marginal) von Neumann entropies, proves Eq. ( 3.24 ). 
The terms that are ignored in the above inequality are 
positive due to strong subadditivity. This property of 
subadditivity of the information processed by quantum 
channels can be straightforwardly extended to n chan- 
nels. 

An alternative definition for the quantum information 
processed by a channel, called "coherent information" , 
has been proposed by Schumacher and Nielsen Q , and 
by Lloyd ||. This quantity I e = S(R'\E r ) = S - L Q is 
not positive axiom (i)], and violates axioms (ii) and (iv), 
which leads to a violation of the reverse data-processing 
inequality, while the "forward" one is respected (as 
opposed to the von Neumann mutual entropy which ob- 
serves both, see below). The coherent information at- 
tempts to capture the "purely" quantum piece of the 
processed information while separating out any classical 
components. This separation appears to be at the origin 
of the shortcomings mentioned above. 



D. Inequalities for quantum channels 



From the properties of the "mutual entanglement" Iq 
derived above, we can prove data-processing inequalities 
for Iq which reflect probability conservation, as well as 
the Fano inequality which relates the loss of a channel to 
the fidelity of a code. 

(i) Data-processing 

Assume that starting with the entangled state QR, 
entanglement with environment E\ produces the mixed 
state Q\. This output is used again as an input to an- 
other channel, this time entangling Q\ with E 2 to obtain 
Q 2 (see Fig. |). 

FIG. 8. Chaining of channels in the derivation of the 
data-processing inequality. The output Qi is subjected to 
a second channel by entangling with an environment E2 in- 
dependent from Ei, to give output Q2- 
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The quantum analogue of the (forward) data- 
processing inequality (2.2) that holds for mutual infor- 
mations in classical channels involves the mutual entan- 
glements S(R:Qi) and S(R:Q2), and asserts that the mu- 
tual entanglement between reference and output cannot 
be increased by any further "processing" : 



S(R:Q 2 ) < S{R:Q X ) < 2S 



(3.29) 



That such an inequality should hold is almost obvious 
from the definition of the mutual entanglement, but a 
short proof is given below. This proof essentially follows 
Rcf . |Q , and is based on the property of strong subad- 
ditivity applied to the system RE X E 2 : 

S{R:E 2 \Ei) = S(R:E X E 2 ) - S(R:Ei) > . (3.30) 

For the channel Q — » Q x , we see easily (see Fig. ||) that 

S{R:E X ) = S(R:Q X E X ) - S{R:Q X \E X ) 

= 2S-S(R:Qi) . (3.31) 

Similarly, considering E X E 2 as the environment for the 
"overall" channel Q — > Q 2 , we find 



S{R:E 1 E 2 )=2S-S(R:Q 2 ) 



(3.32) 



Pluggi ng Eq s. ( [3.31 ) and (3.32) into the positivity con- 
dition ( 3.30| ), we obtain the quantum data processing in- 
equality, Eq. (3.29), as claimed. 

The reverse quantum data-processing inequality im- 
plies that the entanglement processed by the second leg 
of the channel, S(RE X :Q 2 ), must be larger than the en- 
tanglement processed by the entire channel: 

S(R:Q 2 ) < S(RE X :Q 2 ) < S{RE X E 2 :Q 2 ) = 2S{Q 2 ) . (3.33) 

The proof relies on strong subadditivity applied to 
Q 2 E\E 2 : 

S{Q 2 :E X \E 2 ) = S(Q 2 :E X E 2 ) - S(Q 2 :E 2 ) > . (3.34) 

For treating the channel Qi — * Qi (i- c -> the "second leg" ) , 
we have to purify the input state of Q x , that is consider 
RE 1 as the "reference" . Thus, we have 



S(Q 2 :RE 1 ) = 2S(Q 2 )-S(Q 2 :E 2 ) . 
For the "overall" channel Q — > Q 2 , we have 
S(Q 2 :R) = 2S(Q 2 )-S{Q 2 :E X E 2 ) . 



(3.35) 



(3.36) 



These two last equations together with Eq. (3.34), re- 
sult in the reverse quantum data-processing inequality, 
Eq. (pi. 



From Eq. (3.29) we obtain immediately an inequality 
relating the loss of entanglement after the first stage L x 
(we drop the index Q that indicated the quantum nature 
of the loss in this discussion), with the overall loss, L X2 : 



Physically, this implies that the loss L\ 2 cannot decrease 
from simply chaining channels, just as in the classical 
case. As emphasized earlier, the loss L x corresponds to 
the share of initial entanglement that is irretrievably lost 
to the environment. Indeed, if the environment cannot 
be accessed (which is implicit by calling it an environ- 
ment) the decoherence induced by the channel cannot 
be reversed. Only if L x = can this be achieved [p"4[ . 
In view of this fact, it is natural to seek f or a quantum 
equivalent to the classical Fano inequality ([2.4|). 

(ii) Fano inequality 

To investigate this issue, let us consider the chained 
channel above, where error correction has taken place 
via transfer of entanglement with a second environment. 
Let us also recall the definition of "entanglement fidelity" 
of Schumacher |l^] , which is a measure of how faithfully 
the dynamics of the channel has preserved the initial en- 
tangled quantum state QR: 



F e (QR,Q'R) = (QR\ p Q , R \QR) = F««' 



(3.38) 



Since this entanglement fidelity does not depend on the 
reference system |13]|, we drop R from F e from here on, 
as indicated in Eq. (|3.38). 



Naturally, the entanglement fidelity can be related to 
the probability of error of the channel. The quantum 
analogue of the classical Fano inequality should relate 
the fidelity of the code (in our example above the fidelity 
between QR and Q 2 R, the error-corrected system) to the 
loss of the error-correcting channel L X2 . The derivation 
of such an inequality is immediate using the Fano-type 
inequality derived by Schumacher p3| , which relates the 
entropy of the environment of a channel S(E') to the 
fidelity of entanglement, 

S(E') < H 2 [F^'] + (1 - if «') log (d Q d R - 1) , (3.39) 

where dQ and d R are the Hilbert-space dimensions of Q 
and R respectively, and H 2 [F] is again the dyadic Shan- 
non entropy. Let us apply this inequality to an error- 
correcting channel (decoherence + error-correction), i.e., 
the chained channel considered above. In that case, the 
environment is E x E 2l and the entanglement fidelity is 
now between Q and Q 2l i.e., the fidelity of the code, and 
we obtain 

S{E X E 2 ) < H 2 [F^} + (1 - F^) log (d - 1) . (3.40) 

Here, d = d R dQ 2 can be viewed as the Hilbert space 
dimension of the code (this is more apparent in super- 
dense coding discussed in the Section VI). To derive the 
required relationship, we simply note that 



S{E X E 2 ) > L X2 /2 



(3.41) 



< L x < L 



12 



(3.37) 



[this is Eq. ( [3.11 ) applied to the composite channel] . This 
relates the fidelity of the code F®® 2 to the loss L X2) yield- 
ing the Fano inequality for a quantum code 
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L±2 < 2 [H 2 [F® Q2 ] + (1 - Ff Qa ) log (d - 1)] . (3.42) 

As we noticed throughout the construction of quantum 
channels, a factor of 2 appears also in the quantum Fano 
inequality, commensurate with the fact that the loss can 
be twice the initial entropy. Inequality ( |3.42 ) puts an up- 
per limit on the fidelity of a code for any non-vanishing 
loss L 12 . 



transmission is S (just like in the previous section), the 
mutual entropy between R and Q (as well as between 
X and Q) is also S, unlike the value 2S found in the 
quantum use. Decoherence now affects Q by entangling 
it with the environment, just like earlier. Thus, 



Pxq -* Pxq> = 



t Pi\Xi){Xi \ 



(4.3) 



IV. CLASSICAL USE OF QUANTUM CHANNEL 



In recent papers [|16|;|17| , the capacity for the transmis- 
sion of classical information through quantum channels 
has been discussed. Essentially, this capacity is equal 
to the maximal accessible information x i n the system, 
known as the Kholevo bound psj| . 

What we show in the following is that the mutual en- 
tanglement introduced in the previous section, i.e., the 
quantum mutual entropy S(R : Q') between the "deco- 
hered" quantum state Q' and the "reference" state R, 
reduces to x if t he quantum state is measured before it is 
transmitted, or, equivalently, if Q is prepared by a clas- 
sical "preparer" X. Let the system QR be "pu rified" 
again via a Schmidt decomposition as in Eq. ( |3.5|) . If wc 
measure Q in its eigenbasis we can write 



\RXQ) = ^ \ T i X i *) 



(4.1) 



where Xi are the eigenstates of X (if X is in state Xj, 
Q is in state i etc.). (Figure || summarizes the relation- 
ship between the respective entropies.) Naturally then, 
tracing over R we obtain 



PXQ 



Pi \Xi)(Xi \ <g) pi 



(4.2) 



with pi = 



and similarly for Prq. 



FIG. 9. Entanglement between Q, R, and the ancilla (or 
preparer) X after measurement of the initial state of Q by X, 
but prior to entanglement with the environment. The initial 
state of Q (before decoherence) is kept in memory, as it were, 
by X via classical correlation with Q. 




Thus, X and Q are classically correlated: each state of 
the "preparer" X represents a state of Q, or alternatively, 
X reflects (keeps in memory) the initial quantum state 
of Q. If the entropy of the quantum system Q before 



where 

p\ = Tr B [u QE (Pi ® |0)(0|) tf QE ] , (4.4) 

and we assumed again that the environment E is in a 
fixed "0" state before interacting with Q. Now our proof 
proceeds as before, only that the loss in the "classical" 
channel obeys different inequalities. The requirement 
that the entangling operation Uqe does not affect X or 
R now implies 



S(X':E'Q') = S{X:Q) = S(R:Q) = S 



(4.5) 



(see Figure IT( 



FIG. 10. Unitary transformation entangling the "preparer" 
(or alternatively, the classical "memory") X with the pure 
environment E and the quantum system Q. Neither the ref- 
erence R nor the preparer X are affected by this operation. 
As the ternary Venn diagram between Q' , E' and X' is not 
pure in this case, mutual entropy between Q' and X' can be 
shared by E' . 




A pply ing the chain rule to the left hand side of 
Eq. MM leads to 



(4.6) 



S(X':E'Q') = S(X-.Q') + S(X:E'\Q r ) 



The quantum mutual entropy between the preparer and 
the quantum state after decoherence, S(X:Q'), can be 
shown to be equal to the Kholevo bound x ( see Ref. p9f). 
With L = S{X:E'\Q') (the classical loss of the channel) 
we thus conclude from Eqs. (4.6) and (|4.5|) that 



S = X + L. (4.7) 

Note that S(X:Q') is equal to S(R:Q'), the mutual en- 
tanglement Iq introduced earlier, as S(X) = S(R) and 
S(XQ') = S(RQ'). Thus, 



I Q = S(R :Q') = x 



(4.8) 







if known quantum states are sent through the channel, as 
advertised. It was shown recently by Kholevo JItJ that 
the maximum of the latter quantity indeed plays the role 
of channel capacity for classical information transmission 



C = max 

Pi 



max x 

Pi 



(4.9) 



where {pi} is a probability distribution of symbols at 
the source, and p\ are the (not necessarily orthogonal) 
quantum states received at the output, with the prob- 
ability distribution {p{\ and p' = J^iPiPi- Thus, the 
quantity Cq that we propose as a capacity for entan- 
glement/correlation transmission reverts to the capacity 
for information transmission C if the unknown quantum 
states are measured before transmission. This represents 
solid evidence in favor of our interpretation. 

Let us now calculate the quantities introduced here for 
a specific simple model of quantum noise. 



V. QUANTUM DEPOLARIZING CHANNEL 

The quantum depolarizing channel is an idealization 
of a quantum storage and transmission process in which 
the stored quantum state can undergo bit-flip and phase 
errors. This is not the most general one-qubit channel 3 , 
but appears to be sufficient to examine a number of in- 
teresting aspects of quantum communication. 



A. Quantum use 



Imagine a quantum state 

|*) =a\O)+0\l) 



(5.1) 



where the basis states of the qubit can be taken to be 
spin-1/2 states polarized in the z-direction, for example. 
(Specifically, we use the convention <r z \l) = |1).) The de- 
polarizing channel is constructed in such a way that, due 
to an interaction with an environment, the quantum state 
survives with probability 1 — p, but is depolarized with 
probability p/3 by either a pure bit-flip, a pure phase- 
error, or a combination of both: 



*> 


1-p 


1*), 




*> 


p/3 


Oil*) = 


a|l)+/3|0) , 


*> 


p/3 


Oil*) = 


-a\O)+0\l) , 


*> 


p/3 




= -a|l) + /3|0) 



where the a are Pauli matrices. Such an "arbitrary" 
quantum state \& can, without loss of generality, consid- 
ered to be a state Q that is entangled with a reference 
state i?, such that the marginal density matrix of Q can 
be written as 



(5.3) 



with entropy S(pq) = — TrpglogpQ = H2[q] and q a 
probability (0 < q < 1). In other words, the coefficients 
a and f3 need not be complex numbers. Conversely, we 
can start with such a mixed state at the input, and con- 
sider QR as a pure quantum state that this mixed state 
obtains from. For example, 



\QR) = Vl^ffllO) - v?|oi> 



(5.4) 



Naturally then, the mixed state Eq. (|3Jj) is obtained by 
simply tracing over this r efer ence state. Pure states with 
real coefficients such as (5.4) are not general, but suffice 
for the depolarizing channel as R is always traced over. 

Let us now construct a basis for QR that interpolates 
between completely independent and completely entan- 
gled states, and allows us to choose the initial entropy 
of Q with a single parameter q. We thus introduce the 
orthonormal "g-basis" states 



!$-(<?)) = VWloo) 
!$+(<?)) = ^100) + V± 

\*-{q)) = VWIIO) - y/q\0l) , 



g|ii) 



\* + (q)) = V?|io) + vT^loi) 



(5.5) 



Note that for q = or 1, these states are product states, 
while for q — 1/2 they are completely entangled, and 
* ± (l/2) and <f> ± (l/2) are just the usual Bell basis states. 
The possibility of quantum decoherence of these states is 
introduced by entangling them with an environment in a 
pure state, taken to be of the same Hilbcrt space dimen- 
sion as QR for simplicity, i.e., a four-dimensional space 
for the case at hand. This is the minimal realization of a 
depolarizing channel. 

Let us assume that QR (for definitcness) is initially in 
the state \^~(q)}, and the environment in a superposi- 
tion 



\E) = y/T=j\*-(q)) 

+ vp73 (!$-(<?)>- 



|$+(g)> + |* + («)» . (5.6) 



(5.2) 



The environment and QR are then entangled by means 
of the unitary operator Uqre — Uqe <S> Ir, with 



3 A more general depolarizing channel could be constructed 
by allowing each of the possible errors a different probability. 
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U QE = 1 ® Ptf- (<?) + ( 
+ (— icry) ® -P$+(q) 



(5.7) 



where the P$(q) and P^(q) stand for projectors project- 
ing onto q-basis states. Note that the Pauli matrices act 
only on the first bit of the g-basis states, i.e., the entan- 
glement operation only involves Q and E. Depending on 
the entanglement between Q and R, however, this op- 
eration also affects the entanglement between R and E. 
Thus, we obtain the state 

\Q'R'E') = U QRE \QR)\E) = 



*e(<i)) + Vp/z{\®qr(<i) ,$5(?)>+ 



on account of the relations 



-,+ 

QR 



QR\ 

^or(q)) = \Kr( 1 ~i)) 



(5.8) 

(5.9) 
(5.10) 
(5.11) 



with H[p\, ...,p4] the Shannon entropy, and 
A = [(1 - 2p/3) 2 - 16/3p(l - p) q(l - q)] 



1/2 



(5.14) 



By tracing over the reference state we obtain the density 
matrix of the quantum system after the interaction pqi , 
and its respective entropy 

S'(p,q) = S(Q')=H 2 [q+^-(l-2q)} . (5.15) 

Together with the entropy of the reference state (which 
is unchanged since R was not touched by the interac- 
tion), S(R') = S{R) = H 2 [q], this is enough to fill in the 
ternary entropy diagram reflecting the dynamics of the 
channel, Fig. |5[ We thus find the mutual entanglement 
processed by the channel: 

I Q = S(Q'-.R) = 2H 2 [q] - L Q (p, q) , (5.16) 

where the loss is 

L Q (p, q) = H 2 [q] -H 2 [q + ^(l- 2q)] + S e (p, q) . (5.17) 



The mutual entanglement is plotted in Fig. |TT|, as a func- 
tion of the error probability p of the channel and of the 
parameter q which determines the initial entropy. 

FIG. 11. Mutual entanglement between the depolarized 
state Q' and the reference system R' = R, as a function of 
error p and parameter q. Note that the channel is 100% de- 
polarizing at p = 3/4. The concavity in q [according to axiom 
(ii)] as well as the convexity in p [axiom (iii)] are apparent. 



S(Q' 



and with obvious notation to distinguish the environment 
(E) and quantum system (QR) basis states. The (par- 
tially depolarized) density matrix for the quantum sys- 
tem is obtained by tracing over the environment: 

PQ'R' = Tr B {\Q'R'E')(Q'R'E'\) = (1 - p) P»- (q) + 

p/3[P3,-(q)+Ps,+ (l-q)+Py + (l-q)] . (5.12) 

Its eigenvalues can be obtained to calculate the entropy: 

S e {p,q) = S{Q'R') = 

,i(l-! + A),i(l-f-A)], (5.13) 




The mutual entanglement is maximal when the entropy 
of the source is maximal (as in the classical theory), i.e., 
3 = 1/2. Then: 



Cq = max Ir 



2-5 e (p,l/2) = 2-P 2 [p]-plog3 



(5.18) 



In that case, the maximal rate of entanglement transfer 
is 2 bits (error-free transfer, p = 0). The capacity only 
vanishes at p = 3/4, i.e., the 100% depolarizing chan- 
nel. This is analogous to the vanishing of the classical 
capacity of the binary symmetric channel at p = 1/2. 
As an example of such a channel, we shall discuss the 
transmission of the entanglement present in a Bell state 
(one out of four fully entangled qubit pairs) through a 
"superdense coding" channel in Section VI. A. The max- 
imal mut ual entanglement and minimal loss implied by 
Eq. (|5.18 ) are plotted in Fig. [l2] as a function of p. This 
error rate p can be related to the fidelity of the channel 
by 



F Q'Q = ! _ , 



| (1 - 2<Z) 2 



(5.19) 



where ® is Schumacher's fidelity of entanglement in- 
troduced earlier. Note that this implies that the Fano 
inequality Eq. ( 3.39| ) is saturated at q = 1/2 for any p. 
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FIG. 12. Maximal entanglement transfer C(p) and minimal 
loss L(p) as a function of the error probability p. 




B. Classical use 

Now, instead of using the channel to transmit entan- 
glement (sending unknown quantum states), one could 
equally well use it to send classical information (known 
quantum states) as outlined in section IV. Here, we cal- 
culate the capacity for the transmission of classical infor- 
mation through the quantum depolarizing channel and 
verify that the result is equal to the value obtained by 
Calderbank and Shor [0] using the Kholevo theorem. 

Before entanglement with the environment, let us then 
measure the mixed state Q via an ancilla X, after which 
Q and X are classically correlated, with mutual entropy 
H 2 [q\. Note that this operation leads to an entangled 
triplet QRX at the outset, as in Fig. ^, with S — H 2 [q\. 
We now proceed with the calculation as before. The basis 
states for the system \QXR) are then simply 



l*x(9)) = VWlooo) - V?|lll) , 
|#£(g))=V?|ooo) + v / W|in) , 

l*x(<?)> = VT=q\110) - V«|001) , 



(5.20) 



where we used the index X on the basis states to dis- 
tinguish them from the two-qubit basis states introduced 
earlier. The entanglement operation is as before, with a 
unitary operator acting on Q and E only. Because of the 
additional trace over the ancilla X, however, we now find 
for the density matrix pq'r>: 

p Q , w = (l-2p/3)[(l-g)|10)(10| + #1)<01|] 

+ 2p/3[(l-g)|QO)(00| + g|H)<H|] . (5.21) 

Consequently, we find for the mutual information trans- 
mitted through the channel 



I = S(Q':R)=H 2 [q]~L(p,q) , 
with the (classical) loss of information 



(5.22) 



L(p, q) = - q), |g, (1 - |)(1 - q), (1 - |)g] 



-H 2 [q+^-(l-2q)] 



(5.23) 



Maximizing over the input distribution as before, we ob- 
tain 



C = maxS(Q':R) 
q 



l-ff 2 [2p/3] 



(5.24) 



the result derived recently for the depolarizing channel 
simp ly fro m using the Kholevo theorem f|. Note that 
Eq. (|5.24 ) is just the Shannon capacity of a binary sym- 
metric channel [^0|, with a bit-flip probability of 2p/3 
(of the three quantum error "syndromes" , only two are 
classically detectable as bit-flips). 



VI. INTERPRETATION 

A. Quantum capacity and superdense coding 

The interpretation of the capacity suggested here as a 
quantum mechanical extension of the classical construc- 
tion can be illustrated in an intuitive manner with the 
example of the depolarizing channel introduced above. 
The idea is that Iq reflects the capacity for transmission 
of quantum mutual entropy (entanglement and/or clas- 
sical information) but that the amount transferred in a 
particular channel depends on how this channel is used. 
A particularly elegant channel that uses Iq to its full ex- 
tent is the noisy "superdense coding" channel. There, 
the entanglement between sender and receiver is used to 
transmit two bits of classical information by sending just 
one quantum bit [ p^2|j22]| . In a general superdense coding 
scheme, the initial state QR is one of a set of entangled 
states conditionally on classical bits C . This situation 
can be related to our previous discussion by noting that 
all entropies appearing there are to be understood as con- 
ditional on the classical bits C that are to be sent through 
the channel as shown in Fig. [l^. The von Neumann ca- 
pacity introduced above is then just 



I Q = S(R : Q'\C) 



(6.1) 



It is not immediately obvious that this von Neumann ca- 
pacity is equal to the classical capacity between preparer 
(usually termed Alice) and the receiver (Bob). How- 
ever, it is not difficult to prove [using the fact that 



4 That the quantum mutual entropy between a preparer and 
a quantum system is an upper bound to the amount of classi- 
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S(R:Q) = S{R:C) = S{Q:C) = 0] that Eq. is in 

fact equal to the maximal amount of classical informa- 
tion about C extractable from RQ' (after Q decohered), 
which is 4 



X = S(RQ' : C) 



(6.2) 



Thus, in this example the amount of entanglement pro- 
cessed in a channel can be viewed as the amount of clas- 
sical information about the "preparer" of the entangled 
state QR. This amount of information can reach twice 
the entropy of Q (2 bits in standard superdense coding), 
which is classically impossible. (The superdense coding 
and teleportation channels will be discussed in detail else- 
where) . 

FIG. 13. Quantum Venn diagram for the noisy superdense 
coding channel before decoherence. Conditionally on the 
classical bits C, QR is in a pure entangled state described 
by a Venn diagram of the form ( — S,2S,—S). Note that 
no information about C is contained in R or Q alone, i.e., 
S(C:R) = S(C:Q) = 0. 




Having established this relation between superdense 
coding and the general quantum channels treated here, 
let us imagine that the qubit that is sent through the 
channel (and which is "loaded" with entanglement) is 
subject to the depolarizing noise of the previous section. 
Indeed, if p = the two classical bits can be decoded 
perfectly, achieving the value of the capacity. It has been 
argued recently (2^] that this can be understood by re- 
alizing that besides the qubit that is sent forwards in 
time in the channel, the entanglement between sender 
and receiver can be viewed as an antiqubit sent backwards 
in time (which is equivalent to a qubit sent forwards in 
time if the appropriate operations are performed on it in 
the future). Thus, the quantum mechanics of superdense 
coding allows for the time-delayed (error-free) transmis- 
sion of information, which shows up as excessive capacity 
of the respective channel. On the other hand, it is known 
that (for un-encoded qubits) superdense coding becomes 
impossible if p 0.189, which happens to be the precise 
point at which Iq = 1. This is related to the fact that 
at this point the "purification" of "noisy" pairs becomes 
impossible. However, the capacity of this channel is not 



zero. While no information can be retrieved "from the 
past" in this case, the single qubit that is sent through the 
channel still carries information, indeed, it shares one bit 
of mutual entropy with the qubit stored by the receiver. 
Clearly, this is still a quantum channel: if it were classi- 
cal, the transmission of one bit could not take place with 
unit rate and perfect reliability, due to the noise level 
p = 0.189. As the receiver possesses both this particle 
and the one that was shared earlier, he can perform joint 
measurements (in the space Q'R) to retrieve at least one 
of the two classical bits. 

An extreme example is the "dephasing" channel, which 
is a depolarizing channel with only cr z -type errors, affect- 
ing the phase of the qubit. As is well known, classical bits 
are unaffected by this type of noise, while quantum super- 
positions are "dephased". The channel becomes useless 
(for the storage of superpositions) at p = 0.5, yet mea- 
suring the qubit yields one classical bit in an error-free 
manner. A calculation of max g S(R : Q') for this channel 
indeed yields 



I Q (p) = 2-H 2 \p] 



(6.3) 



In this limiting case thus, it appears possible to separate 
the classical (/ = 1) from the purely quantum capacity. 
However, it might well be possible that this cannot be 
achieved in general. Below, we show that such an "ex- 
cessive" von Neumann capacity (as in superdense coding) 
is consistent with a commensurate quantum Hamming 
bound. 



B. Quantum Hamming bounds 

Classically, the Hamming bound |2C[ | is an upper bound 
on the number s of codewords (bit-strings of length n) 
for a code to correct t errors: 



E 



< T 



(6.4) 



This is a necessary (but not sufficient) condition for error- 
free coding, which reflects the necessary space to accom- 
modate all the codewords and associated descendants for 
all error syndromes. For s codewords coding for k bits 
(s = 2*), we can consider the asymptotics of (^4) in the 
limit of infinitely long messages (n — > oo), and find that 
the rate of error-free transmission is limited by 



pn 

R< — logV 



, i , 



,2/ 



(6.5) 



cal information obtainable by measuring the quantum system 
(the Kholevo bound) is shown in Ref. bsj. 
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where R = k/n is the transmission rate and p — t/n is 
the asymptotic probability of error. Using 



lim 

n — >oo 



p log - + (1-p) log- — - 
r 1 — r 

H(p, 1 - p || r, 1 - r) , 



(6.6) 



where 77 (p, 1 — p |j r, 1 — r) is the relative entropy between 
the probability distributions p and r, we can write 



R < H(p, 1 - p || 1/2, 1/2) - 1 - H 2 (p) 



(6.7) 



The relative entropy thus turns out to be just the classical 
capacity of the channel, and measures the "distance" of 
the error-probability of the channel relative to the "worst 
case", i.e., p = 1/2. Note that relative entropies are pos- 
itive semi-definite. 

For quantum channels, the standard quantum Ham- 
ming bound for non-degenerate (orthogonal) codes is 
written as iH-llC 



t 

2 k 3 s 

i=0 



< 2" 



(6.8) 



which expresses that the number of orthogonal states 
identifying the error syndromes on the 2 k different mes- 
sages must be smaller than 2", the dimension of the 
Hilbert space of the quantum state Q (n qubits). In 
the limit of large n, this translates into an upper bound 
for the rate of non-degenerate quantum codes 



f pn 

i?<--log £ 

I 8=0 



- 1 



(6.9) 



which can (as in the classical case) be written in terms 
of a relative entropy 

i?<#(p,l-p|| 3/4,1/4) - 1 = 1-S e (p), (6.10) 

Thus, the usual quantum Hamming bound limits the rate 
of non-degenerate quantum codes by the capacity based 
on "coherent information" proposed in |lj,|l^], which is 
thought of as the "purely quantum" piece of the capacity. 
Note that the positivity of relative entropy does not in 
this case guarantee such a capacity to be positive, which 
may just be a reflection of the "inseparability" of the von 
Neumann capacity. 

The quantum Hamming bound shown above relies on 
coding the error syndromes only into the quantum state 
Q that is processed, or, in the case of superdense coding, 
sent through the noisy channel. As we noted earlier, how- 
ever, a quantum system that is entangled does not, as a 
matter of principle, have a state on its own. Thus, the 
entangled reference system R necessarily becomes part of 



the quantum system, even if it is not subject to decoher- 
ence. Thus, the Hilbert space available for "coding" au- 
tomatically becomes as large as 2n, the combined Hilbert 
space of Q and R. This is most obvious again in super- 
dense coding, where the "decoding" of the information 
explicitly involves joint measurements of the decohered 
Q' and the "reference" R, shared between sender and 
receiver (in a noise- free manner). The corresponding en- 
tanglement quantum Hamming bound therefore can be 
written by remarking that while the coding space is 2n, 
only n qubits are sent through the channel, and thus 



i=0 



2 k J2 y ( n ) ^ 22 " • 



(6.11) 



Proceeding as before, the rate of such quantum codes is 
limited by 



i?<tf(p,l-p||3/4,l/4) = 2-S e [p) , 



(6.12) 



the von Neumann capacity Cq for the depolarizing chan- 
nel proposed in this paper, Eqs. ( |3 . 1 5 ) and ( |5.18 ). The 
latter is always positive, and represents the "distance" 
between the error probability p of the channel and the 
worst-case error p = 3/4 (corresponding to a 100% de- 
polarizing channel) , in p erfect analogy with the classical 
construction. Eq. ( |6.12 ) thus guarantees the weak con- 
verse of the quantum fundamental theorem: that no code 
can be constructed that maintains a rate larger than the 
capacity Cq with a fidelity arbitrarily close to one. 



VII. CONCLUSIONS 

We have shown that the classical concept of informa- 
tion transmission capacity can be extended to the quan- 
tum regime by defining a von Neumann capacity as the 
maximum mutual von Neumann entropy between the de- 
cohered quantum system and its reference. This mu- 
tual von Neumann entropy, that describes the amount 
of information — classical and/or quantum — processed by 
the channel, obeys "axioms" that any measure of infor- 
mation should conform to. As for any quantum exten- 
sion, the von Neumann capacity reverts to its classical 
counterpart when the information is "classicized" (i.e., it 
reverts to the Kholevo capacity when measured or pre- 
pared states are sent), and ultimately to the Shannon ca- 
pacity if all quantum aspects of the channel are ignored 
(i.e., if orthogonal states are sent and measured). Thus, 
the von Neumann capacity of a channel can only vanish 
when the classical capacity is also zero, but it can be 
excessive as entanglement allows for superdense coding. 
In order to take advantage of this, however, both the 
quantum system that decoheres and the reference sys- 
tem it is entangled with need to be accessible. In practi- 
cal quantum channels this appears to be impossible, and 
the rate of practical codes must then be considerably 
smaller than the von Neumann capacity. Yet, because of 
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the inseparability of entangled states, a consistent defi- 
nition of channel capacity has to take into account the 
full Hilbert space of the state. Whether a capacity can 
be defined consistently that characterizes the "purely" 
quantum component of a channel is still an open ques- 
tion. 
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